Overview

Dataset statistics

Number of variables13
Number of observations92872
Missing cells27053
Missing cells (%)2.2%
Duplicate rows27
Duplicate rows (%)< 0.1%
Total size in memory9.2 MiB
Average record size in memory104.0 B

Variable types

Categorical1
Numeric12

Warnings

Dataset has 27 (< 0.1%) duplicate rowsDuplicates
Liquidity is highly correlated with Return on EquityHigh correlation
Return on Equity is highly correlated with LiquidityHigh correlation
EPS is highly correlated with Profitability and 3 other fieldsHigh correlation
Profitability is highly correlated with EPS and 3 other fieldsHigh correlation
Productivity is highly correlated with EPS and 3 other fieldsHigh correlation
Operational Margin is highly correlated with EPS and 3 other fieldsHigh correlation
Return on Equity is highly correlated with EPS and 3 other fieldsHigh correlation
Assets Growth is highly correlated with Employee GrowthHigh correlation
Employee Growth is highly correlated with Assets GrowthHigh correlation
EPS is highly correlated with Productivity and 1 other fieldsHigh correlation
Profitability is highly correlated with ProductivityHigh correlation
Productivity is highly correlated with EPS and 3 other fieldsHigh correlation
Operational Margin is highly correlated with ProductivityHigh correlation
Return on Equity is highly correlated with EPS and 1 other fieldsHigh correlation
Return on Equity is highly correlated with EPSHigh correlation
Liquidity is highly correlated with ProfitabilityHigh correlation
Profitability is highly correlated with LiquidityHigh correlation
EPS is highly correlated with Return on EquityHigh correlation
Operational Margin has 5557 (6.0%) missing values Missing
Assets Growth has 6701 (7.2%) missing values Missing
Sales Growth has 6701 (7.2%) missing values Missing
Employee Growth has 7010 (7.5%) missing values Missing
EPS is highly skewed (γ1 = -137.6928255) Skewed
Liquidity is highly skewed (γ1 = -138.2539813) Skewed
Profitability is highly skewed (γ1 = -50.43091391) Skewed
Productivity is highly skewed (γ1 = -80.9125844) Skewed
Leverage Ratio is highly skewed (γ1 = 290.9228157) Skewed
Asset Turnover is highly skewed (γ1 = 70.7239712) Skewed
Operational Margin is highly skewed (γ1 = -81.07270041) Skewed
Return on Equity is highly skewed (γ1 = -165.8537632) Skewed
Assets Growth is highly skewed (γ1 = 137.2063306) Skewed
Sales Growth is highly skewed (γ1 = 191.0078837) Skewed
Employee Growth is highly skewed (γ1 = 128.4063963) Skewed
EPS has 1833 (2.0%) zeros Zeros
Liquidity has 1706 (1.8%) zeros Zeros
Productivity has 1581 (1.7%) zeros Zeros
Leverage Ratio has 17809 (19.2%) zeros Zeros
Asset Turnover has 5907 (6.4%) zeros Zeros
Operational Margin has 1360 (1.5%) zeros Zeros
Return on Equity has 2355 (2.5%) zeros Zeros
Sales Growth has 5018 (5.4%) zeros Zeros
Employee Growth has 7394 (8.0%) zeros Zeros

Reproduction

Analysis started2021-09-12 15:31:45.011693
Analysis finished2021-09-12 15:32:44.510135
Duration59.5 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

BK
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size5.1 MiB
0
92314 
1
 
558

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters92872
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

Most occurring characters

ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number92872
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common92872
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII92872
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
092314
99.4%
1558
 
0.6%

EPS
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct5161
Distinct (%)5.6%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean-14.46135521
Minimum-384000
Maximum55339
Zeros1833
Zeros (%)2.0%
Negative33992
Negative (%)36.6%
Memory size725.7 KiB

Quantile statistics

Minimum-384000
5-th percentile-1.85
Q1-0.14
median0.33
Q31.53
95-th percentile4.3
Maximum55339
Range439339
Interquartile range (IQR)1.67

Descriptive statistics

Standard deviation2195.467288
Coefficient of variation (CV)-151.8161511
Kurtosis20817.46
Mean-14.46135521
Median Absolute Deviation (MAD)0.749
Skewness-137.6928255
Sum-1342982.674
Variance4820076.611
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01833
 
2.0%
-0.011593
 
1.7%
-0.021269
 
1.4%
-0.031014
 
1.1%
-0.04906
 
1.0%
-0.05839
 
0.9%
-0.06722
 
0.8%
-0.07654
 
0.7%
0.01652
 
0.7%
-0.09565
 
0.6%
Other values (5151)82820
89.2%
ValueCountFrequency (%)
-3840001
< 0.1%
-3218981
< 0.1%
-3148931
< 0.1%
-2050001
< 0.1%
-1523221
< 0.1%
-129042.51
< 0.1%
-482001
< 0.1%
-424191
< 0.1%
-418041
< 0.1%
-112131
< 0.1%
ValueCountFrequency (%)
553391
< 0.1%
512771
< 0.1%
421761
< 0.1%
285201
< 0.1%
280001
< 0.1%
147251
< 0.1%
134161
< 0.1%
52091
< 0.1%
15431
< 0.1%
1066.141
< 0.1%

Liquidity
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct3114
Distinct (%)3.4%
Missing247
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean-2.631237398
Minimum-25968.52
Maximum1
Zeros1706
Zeros (%)1.8%
Negative18343
Negative (%)19.8%
Memory size725.7 KiB

Quantile statistics

Minimum-25968.52
5-th percentile-0.48
Q10.02
median0.19
Q30.4
95-th percentile0.73
Maximum1
Range25969.52
Interquartile range (IQR)0.38

Descriptive statistics

Standard deviation121.6109207
Coefficient of variation (CV)-46.21814846
Kurtosis25278.28839
Mean-2.631237398
Median Absolute Deviation (MAD)0.18
Skewness-138.2539813
Sum-243718.364
Variance14789.21603
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01706
 
1.8%
0.021639
 
1.8%
0.011598
 
1.7%
0.031492
 
1.6%
0.041489
 
1.6%
-0.011486
 
1.6%
0.061469
 
1.6%
0.071460
 
1.6%
0.051450
 
1.6%
0.081446
 
1.6%
Other values (3104)77390
83.3%
ValueCountFrequency (%)
-25968.521
< 0.1%
-137181
< 0.1%
-92421
< 0.1%
-8658.671
< 0.1%
-85561
< 0.1%
-63231
< 0.1%
-5807.751
< 0.1%
-43071
< 0.1%
-31691
< 0.1%
-30291
< 0.1%
ValueCountFrequency (%)
130
< 0.1%
0.9921
 
< 0.1%
0.9943
< 0.1%
0.9821
 
< 0.1%
0.9850
0.1%
0.9762
 
< 0.1%
0.9753
0.1%
0.9671
 
< 0.1%
0.9663
0.1%
0.9561
 
< 0.1%

Profitability
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct7040
Distinct (%)7.6%
Missing247
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean-29.53727365
Minimum-79682
Maximum140.58
Zeros823
Zeros (%)0.9%
Negative39657
Negative (%)42.7%
Memory size725.7 KiB

Quantile statistics

Minimum-79682
5-th percentile-13.63
Q1-0.64
median0.07
Q30.31
95-th percentile0.64
Maximum140.58
Range79822.58
Interquartile range (IQR)0.95

Descriptive statistics

Standard deviation677.2306671
Coefficient of variation (CV)-22.92800192
Kurtosis3709.939983
Mean-29.53727365
Median Absolute Deviation (MAD)0.32
Skewness-50.43091391
Sum-2735889.972
Variance458641.3765
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.141020
 
1.1%
0.111010
 
1.1%
0.1999
 
1.1%
0.13979
 
1.1%
0.16972
 
1.0%
0.09969
 
1.0%
0.12955
 
1.0%
0.17955
 
1.0%
0.19950
 
1.0%
0.15947
 
1.0%
Other values (7030)82869
89.2%
ValueCountFrequency (%)
-796821
< 0.1%
-561861
< 0.1%
-494721
< 0.1%
-34944.331
< 0.1%
-339151
< 0.1%
-327191
< 0.1%
-305761
< 0.1%
-30325.51
< 0.1%
-30015.751
< 0.1%
-293171
< 0.1%
ValueCountFrequency (%)
140.581
< 0.1%
5.741
< 0.1%
3.711
< 0.1%
2.661
< 0.1%
2.361
< 0.1%
2.251
< 0.1%
2.241
< 0.1%
1.982
< 0.1%
1.951
< 0.1%
1.9421
< 0.1%

Productivity
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct2823
Distinct (%)3.0%
Missing247
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean-1.22287051
Minimum-5093
Maximum1102
Zeros1581
Zeros (%)1.7%
Negative29653
Negative (%)31.9%
Memory size725.7 KiB

Quantile statistics

Minimum-5093
5-th percentile-1.18
Q1-0.06
median0.06
Q30.11
95-th percentile0.23
Maximum1102
Range6195
Interquartile range (IQR)0.17

Descriptive statistics

Standard deviation35.88555642
Coefficient of variation (CV)-29.34534451
Kurtosis8888.355102
Mean-1.22287051
Median Absolute Deviation (MAD)0.07
Skewness-80.9125844
Sum-113268.381
Variance1287.773159
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.074328
 
4.7%
0.084240
 
4.6%
0.064178
 
4.5%
0.094087
 
4.4%
0.13762
 
4.1%
0.053710
 
4.0%
0.113381
 
3.6%
0.043143
 
3.4%
0.122938
 
3.2%
0.032560
 
2.8%
Other values (2813)56298
60.6%
ValueCountFrequency (%)
-50931
< 0.1%
-45951
< 0.1%
-26401
< 0.1%
-25981
< 0.1%
-2442.171
< 0.1%
-23481
< 0.1%
-21361
< 0.1%
-21031
< 0.1%
-1681.51
< 0.1%
-1624.751
< 0.1%
ValueCountFrequency (%)
11021
< 0.1%
35.921
< 0.1%
17.711
< 0.1%
17.111
< 0.1%
10.71
< 0.1%
10.521
< 0.1%
7.721
< 0.1%
6.481
< 0.1%
5.281
< 0.1%
5.271
< 0.1%

Leverage Ratio
Real number (ℝ)

SKEWED
ZEROS

Distinct5297
Distinct (%)5.7%
Missing26
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.345316276
Minimum-7811
Maximum75970.38
Zeros17809
Zeros (%)19.2%
Negative7256
Negative (%)7.8%
Memory size725.7 KiB

Quantile statistics

Minimum-7811
5-th percentile-0.75
Q10
median0.28
Q30.82
95-th percentile2.75
Maximum75970.38
Range83781.38
Interquartile range (IQR)0.82

Descriptive statistics

Standard deviation253.0380927
Coefficient of variation (CV)188.088182
Kurtosis87527.86679
Mean1.345316276
Median Absolute Deviation (MAD)0.28
Skewness290.9228157
Sum124907.235
Variance64028.27634
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
017809
 
19.2%
0.011954
 
2.1%
0.021288
 
1.4%
0.031092
 
1.2%
0.04916
 
1.0%
0.05851
 
0.9%
0.06776
 
0.8%
0.08758
 
0.8%
0.07748
 
0.8%
0.1725
 
0.8%
Other values (5287)65929
71.0%
ValueCountFrequency (%)
-78111
< 0.1%
-7270.331
< 0.1%
-3231.41
< 0.1%
-1473.11
< 0.1%
-1470.971
< 0.1%
-1042.1051
< 0.1%
-776.591
< 0.1%
-676.761
< 0.1%
-633.841
< 0.1%
-558.521
< 0.1%
ValueCountFrequency (%)
75970.381
< 0.1%
3096.641
< 0.1%
2238.5671
< 0.1%
2183.591
< 0.1%
21671
< 0.1%
1726.91
< 0.1%
960.51
< 0.1%
936.31
< 0.1%
912.541
< 0.1%
863.131
< 0.1%

Asset Turnover
Real number (ℝ)

SKEWED
ZEROS

Distinct2987
Distinct (%)3.2%
Missing247
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1.053613323
Minimum-31.59
Maximum276.38
Zeros5907
Zeros (%)6.4%
Negative15
Negative (%)< 0.1%
Memory size725.7 KiB

Quantile statistics

Minimum-31.59
5-th percentile0
Q10.39
median0.83
Q31.39
95-th percentile2.67
Maximum276.38
Range307.97
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.115945321
Coefficient of variation (CV)2.008275024
Kurtosis7656.622443
Mean1.053613323
Median Absolute Deviation (MAD)0.49
Skewness70.7239712
Sum97590.934
Variance4.4772246
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
05907
 
6.4%
0.01632
 
0.7%
0.54570
 
0.6%
0.63555
 
0.6%
0.49552
 
0.6%
0.38546
 
0.6%
0.46545
 
0.6%
0.42543
 
0.6%
0.67540
 
0.6%
0.5538
 
0.6%
Other values (2977)81697
88.0%
ValueCountFrequency (%)
-31.591
 
< 0.1%
-1.441
 
< 0.1%
-0.491
 
< 0.1%
-0.281
 
< 0.1%
-0.152
< 0.1%
-0.111
 
< 0.1%
-0.072
< 0.1%
-0.061
 
< 0.1%
-0.032
< 0.1%
-0.013
< 0.1%
ValueCountFrequency (%)
276.381
< 0.1%
243.671
< 0.1%
240.741
< 0.1%
182.211
< 0.1%
134.631
< 0.1%
102.081
< 0.1%
831
< 0.1%
81.51
< 0.1%
71.881
< 0.1%
63.51
< 0.1%

Operational Margin
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct4511
Distinct (%)5.2%
Missing5557
Missing (%)6.0%
Infinite0
Infinite (%)0.0%
Mean-7.915485174
Minimum-30175.7
Maximum394.47
Zeros1360
Zeros (%)1.5%
Negative24522
Negative (%)26.4%
Memory size725.7 KiB

Quantile statistics

Minimum-30175.7
5-th percentile-4.25
Q1-0.03
median0.06
Q30.14
95-th percentile0.31
Maximum394.47
Range30570.17
Interquartile range (IQR)0.17

Descriptive statistics

Standard deviation214.4600791
Coefficient of variation (CV)-27.0937377
Kurtosis9073.757914
Mean-7.915485174
Median Absolute Deviation (MAD)0.08
Skewness-81.07270041
Sum-691140.588
Variance45993.12553
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.073582
 
3.9%
0.053341
 
3.6%
0.063330
 
3.6%
0.083292
 
3.5%
0.043267
 
3.5%
0.093167
 
3.4%
0.032984
 
3.2%
0.12975
 
3.2%
0.112723
 
2.9%
0.122494
 
2.7%
Other values (4501)56160
60.5%
(Missing)5557
 
6.0%
ValueCountFrequency (%)
-30175.71
< 0.1%
-282361
< 0.1%
-15265.31
< 0.1%
-130481
< 0.1%
-11741.51
< 0.1%
-115051
< 0.1%
-112781
< 0.1%
-10856.671
< 0.1%
-104631
< 0.1%
-102341
< 0.1%
ValueCountFrequency (%)
394.471
< 0.1%
193.141
< 0.1%
119.361
< 0.1%
85.941
< 0.1%
10.821
< 0.1%
10.811
< 0.1%
10.011
< 0.1%
9.291
< 0.1%
9.021
< 0.1%
2.8891
< 0.1%

Return on Equity
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct2997
Distinct (%)3.2%
Missing8
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean-2.11238201
Minimum-88875.14
Maximum39500
Zeros2355
Zeros (%)2.5%
Negative34186
Negative (%)36.8%
Memory size725.7 KiB

Quantile statistics

Minimum-88875.14
5-th percentile-0.95
Q1-0.08
median0.03
Q30.07
95-th percentile0.15
Maximum39500
Range128375.14
Interquartile range (IQR)0.15

Descriptive statistics

Standard deviation352.5969019
Coefficient of variation (CV)-166.919099
Kurtosis45744.33541
Mean-2.11238201
Median Absolute Deviation (MAD)0.05
Skewness-165.8537632
Sum-196164.243
Variance124324.5752
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.056876
 
7.4%
0.066328
 
6.8%
0.046268
 
6.7%
0.075152
 
5.5%
0.035060
 
5.4%
0.083853
 
4.1%
0.023759
 
4.0%
0.012882
 
3.1%
0.092868
 
3.1%
02355
 
2.5%
Other values (2987)47463
51.1%
ValueCountFrequency (%)
-88875.141
< 0.1%
-24848.481
< 0.1%
-24627.8061
< 0.1%
-14360.511
< 0.1%
-10261.361
< 0.1%
-8691.51
< 0.1%
-8415.581
< 0.1%
-7590.7351
< 0.1%
-6825.1431
< 0.1%
-5207.411
< 0.1%
ValueCountFrequency (%)
395001
< 0.1%
11228.851
< 0.1%
7468.651
< 0.1%
2137.431
< 0.1%
2082.771
< 0.1%
861.541
< 0.1%
813.811
< 0.1%
504.461
< 0.1%
483.4591
< 0.1%
445.11
< 0.1%

Market Book Ratio
Real number (ℝ)

Distinct49812
Distinct (%)53.7%
Missing57
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean358.5062651
Minimum-3151500
Maximum3455419.33
Zeros50
Zeros (%)0.1%
Negative8923
Negative (%)9.6%
Memory size725.7 KiB

Quantile statistics

Minimum-3151500
5-th percentile-92.4185
Q111.2
median58.28
Q3240.14
95-th percentile2170.874
Maximum3455419.33
Range6606919.33
Interquartile range (IQR)228.94

Descriptive statistics

Standard deviation26063.63798
Coefficient of variation (CV)72.70064854
Kurtosis8386.750539
Mean358.5062651
Median Absolute Deviation (MAD)56.36
Skewness11.32109506
Sum33274759
Variance679313224.8
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
050
 
0.1%
1.7226
 
< 0.1%
1.8125
 
< 0.1%
1.6525
 
< 0.1%
2.3424
 
< 0.1%
1.5824
 
< 0.1%
1.4423
 
< 0.1%
2.4523
 
< 0.1%
4.5623
 
< 0.1%
1.622
 
< 0.1%
Other values (49802)92550
99.7%
(Missing)57
 
0.1%
ValueCountFrequency (%)
-31515001
< 0.1%
-2306965.031
< 0.1%
-16961591
< 0.1%
-1136503.51
< 0.1%
-1076149.761
< 0.1%
-962043.591
< 0.1%
-8437501
< 0.1%
-8103001
< 0.1%
-804683.331
< 0.1%
-762128.841
< 0.1%
ValueCountFrequency (%)
3455419.331
< 0.1%
25454001
< 0.1%
2291347.51
< 0.1%
10395001
< 0.1%
1020718.451
< 0.1%
1003884.811
< 0.1%
857782.981
< 0.1%
818383.51
< 0.1%
7854961
< 0.1%
6050451
< 0.1%

Assets Growth
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED

Distinct4650
Distinct (%)5.4%
Missing6701
Missing (%)7.2%
Infinite0
Infinite (%)0.0%
Mean1.294076824
Minimum-1
Maximum14231
Zeros438
Zeros (%)0.5%
Negative30845
Negative (%)33.2%
Memory size725.7 KiB

Quantile statistics

Minimum-1
5-th percentile-0.388
Q1-0.053
median0.052
Q30.192
95-th percentile1.037
Maximum14231
Range14232
Interquartile range (IQR)0.245

Descriptive statistics

Standard deviation73.76952173
Coefficient of variation (CV)57.00551958
Kurtosis22286.38885
Mean1.294076824
Median Absolute Deviation (MAD)0.119
Skewness137.2063306
Sum111511.894
Variance5441.942336
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0438
 
0.5%
0.006255
 
0.3%
0.023254
 
0.3%
0.039253
 
0.3%
0.033251
 
0.3%
0.013248
 
0.3%
0.071248
 
0.3%
0.045247
 
0.3%
0.036246
 
0.3%
0.052246
 
0.3%
Other values (4640)83485
89.9%
(Missing)6701
 
7.2%
ValueCountFrequency (%)
-1114
0.1%
-0.9998
 
< 0.1%
-0.9984
 
< 0.1%
-0.9978
 
< 0.1%
-0.9968
 
< 0.1%
-0.9955
 
< 0.1%
-0.9946
 
< 0.1%
-0.9939
 
< 0.1%
-0.9925
 
< 0.1%
-0.9913
 
< 0.1%
ValueCountFrequency (%)
142311
< 0.1%
10173.721
< 0.1%
78721
< 0.1%
50761
< 0.1%
47121
< 0.1%
2440.7781
< 0.1%
24051
< 0.1%
2182.4551
< 0.1%
2138.251
< 0.1%
1935.81
< 0.1%

Sales Growth
Real number (ℝ)

MISSING
SKEWED
ZEROS

Distinct4619
Distinct (%)5.4%
Missing6701
Missing (%)7.2%
Infinite0
Infinite (%)0.0%
Mean1.900109329
Minimum-27.431
Maximum39850
Zeros5018
Zeros (%)5.4%
Negative26340
Negative (%)28.4%
Memory size725.7 KiB

Quantile statistics

Minimum-27.431
5-th percentile-0.419
Q1-0.034
median0.06
Q30.204
95-th percentile0.967
Maximum39850
Range39877.431
Interquartile range (IQR)0.238

Descriptive statistics

Standard deviation177.632638
Coefficient of variation (CV)93.4854828
Kurtosis39255.81681
Mean1.900109329
Median Absolute Deviation (MAD)0.116
Skewness191.0078837
Sum163734.321
Variance31553.35409
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
05018
 
5.4%
-1699
 
0.8%
0.05234
 
0.3%
0.033224
 
0.2%
0.032223
 
0.2%
0.08222
 
0.2%
0.068221
 
0.2%
0.076220
 
0.2%
0.045220
 
0.2%
0.034219
 
0.2%
Other values (4609)78671
84.7%
(Missing)6701
 
7.2%
ValueCountFrequency (%)
-27.4311
< 0.1%
-9.2861
< 0.1%
-6.4881
< 0.1%
-3.7981
< 0.1%
-3.5621
< 0.1%
-3.1241
< 0.1%
-1.9231
< 0.1%
-1.8991
< 0.1%
-1.7831
< 0.1%
-1.6331
< 0.1%
ValueCountFrequency (%)
398501
< 0.1%
302451
< 0.1%
9326.51
< 0.1%
5887.2311
< 0.1%
52031
< 0.1%
3701.4671
< 0.1%
32131
< 0.1%
2996.6671
< 0.1%
2250.8421
< 0.1%
21671
< 0.1%

Employee Growth
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct3308
Distinct (%)3.9%
Missing7010
Missing (%)7.5%
Infinite0
Infinite (%)0.0%
Mean0.3433204444
Minimum-1
Maximum2699
Zeros7394
Zeros (%)8.0%
Negative31627
Negative (%)34.1%
Memory size725.7 KiB

Quantile statistics

Minimum-1
5-th percentile-0.333
Q1-0.048
median0.017
Q30.131
95-th percentile0.667
Maximum2699
Range2700
Interquartile range (IQR)0.179

Descriptive statistics

Standard deviation14.07415617
Coefficient of variation (CV)40.99422683
Kurtosis20454.00961
Mean0.3433204444
Median Absolute Deviation (MAD)0.087
Skewness128.4063963
Sum29478.18
Variance198.081872
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
07394
 
8.0%
0.2328
 
0.4%
0.25314
 
0.3%
0.333314
 
0.3%
0.024306
 
0.3%
0.071305
 
0.3%
0.038305
 
0.3%
0.1303
 
0.3%
0.029297
 
0.3%
0.045295
 
0.3%
Other values (3298)75701
81.5%
(Missing)7010
 
7.5%
ValueCountFrequency (%)
-1228
0.2%
-0.9993
 
< 0.1%
-0.9981
 
< 0.1%
-0.9975
 
< 0.1%
-0.9961
 
< 0.1%
-0.9944
 
< 0.1%
-0.9934
 
< 0.1%
-0.9926
 
< 0.1%
-0.9918
 
< 0.1%
-0.993
 
< 0.1%
ValueCountFrequency (%)
26991
< 0.1%
1671.51
< 0.1%
16001
< 0.1%
856.51
< 0.1%
818.2771
< 0.1%
6291
< 0.1%
5691
< 0.1%
549.3331
< 0.1%
4491
< 0.1%
414.6671
< 0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

BKEPSLiquidityProfitabilityProductivityLeverage RatioAsset TurnoverOperational MarginReturn on EquityMarket Book RatioAssets GrowthSales GrowthEmployee Growth
001.580.360.180.131.331.770.070.152.22NaNNaNNaN
101.410.360.190.121.311.590.070.132.410.1260.0140.040
200.310.320.130.081.031.550.050.042.560.3680.3280.567
300.710.280.140.080.801.390.060.055.28-0.021-0.119-0.096
400.750.410.130.080.201.300.060.048.680.2330.1470.053
501.500.370.160.110.341.410.080.087.820.1320.2320.077
601.260.360.160.110.601.250.090.0520.510.2760.1330.189
701.460.370.180.120.361.270.100.0525.100.1860.2020.017
801.340.370.210.120.481.220.100.0542.050.2120.1660.099
901.550.360.220.130.561.140.110.0546.680.2510.1690.276

Last rows

BKEPSLiquidityProfitabilityProductivityLeverage RatioAsset TurnoverOperational MarginReturn on EquityMarket Book RatioAssets GrowthSales GrowthEmployee Growth
928620-0.0240.317-0.329-0.0970.0430.092-1.057-0.031159.570NaNNaNNaN
928630-0.0320.018-0.090-0.0082.6270.064-0.126-0.011481.7554.2432.6442.667
928640-0.903-0.038-1.158-0.916-3.9360.149-6.147-0.387-539.641-0.1311.0350.000
928650-0.358-0.065-0.140-0.0131.5490.137-0.092-0.176-311.77511.12410.1264.455
928660-1.469-0.013-0.565-0.2465.7690.365-0.674-8.159-8.924-0.3480.738-0.133
928670-1.488-0.015-0.759-0.057-1042.1050.174-0.327-6.614-1.847-0.073-0.557-0.077
928680-1.8080.094-1.205-0.121-4.5300.216-0.561-4.519-2.475-0.202-0.011-0.208
928690-0.0160.0390.000-0.0820.7450.254-0.324-0.5693274.506-0.168-0.020-0.105
928700-0.1330.054-0.0290.0010.5750.1960.005-0.08636.4750.077-0.171-0.059
928711-0.648-0.037-0.220-0.1450.6930.222-0.651-0.49855.624-0.0640.0650.063

Duplicate rows

Most frequently occurring

BKEPSLiquidityProfitabilityProductivityLeverage RatioAsset TurnoverOperational MarginReturn on EquityMarket Book RatioAssets GrowthSales GrowthEmployee Growth# duplicates
000.95-0.050.070.062.070.420.130.0629.370.5410.336-0.0022
100.99-0.050.070.091.330.700.130.0543.74-0.0100.640-0.0782
201.570.010.110.081.110.290.260.08187.14-0.009-0.001-0.0312
302.140.050.240.090.730.580.160.0933.26-0.0020.0050.0202
402.150.050.230.090.690.580.160.0933.350.036-0.0310.0022
502.270.060.230.120.650.610.200.1131.200.019-0.020-0.0022
602.280.000.120.101.040.330.290.08265.590.0320.072-0.0112
702.29-0.020.110.081.060.290.290.08257.600.154-0.006-0.0232
802.370.060.240.090.820.580.160.0840.080.0640.0590.0342
902.430.050.240.090.750.550.170.0843.690.0670.0180.0272